TEM virus images: Benchmark dataset and deep learning classification
نویسندگان
چکیده
• We publish a new challenging dataset with 1245 TEM images of 22 virus classes. propose baseline classification for this dataset. Our best model, fine-tuned DenseNet201, achieved 93.1% accuracy on the test set. custom CNN 90.1% despite being 10x smaller than DenseNet201. show importance application knowledge in design and interpretation. To achieve full potential deep learning (DL) models, such as understanding interplay between model (size), training strategy, amount data, researchers developers need access to dedicated image datasets; i.e., annotated collections representing real-world problems all their variations, complexity, limitations, noise. Here, we present, describe make freely available an transmission electron microscopy (TEM) It constitutes interesting challenge many practical applications virology epidemiology; e.g., detection, segmentation, classification, novelty detection. also present benchmarking results detection recognition using some top-performing (large small) networks well handcrafted very small network. compare evaluate transfer from scratch hypothesizing that limited dataset, is crucial good performance large network whereas our performs relatively when scratch. This one step towards how much data needed given task. The benchmark contains representative split into training, validation, sets Moreover, different established DL solution classifying subset 14 most-represented classes DenseNet201 pre-trained ImageNet set, 0.921 F1-score proposed Public real biomedical datasets are important contribution necessity increase shortcomings, requirements, improvements solutions or deploying clinical settings. compared hypothesize limited-sized achieving models. Last but not least, demonstrate creating models analyzing results.
منابع مشابه
EuroSAT: A Novel Dataset and Deep Learning Benchmark for Land Use and Land Cover Classification
In this paper, we address the challenge of land use and land cover classification using remote sensing satellite images. For this challenging task, we use the openly and freely accessible Sentinel-2 satellite images provided within the scope of the Earth observation program Copernicus. The key contributions are as follows. We present a novel dataset based on satellite images covering 13 differe...
متن کاملA Benchmark Dataset for Audio Classification and Clustering
We present a freely available benchmark dataset for audio classification and clustering. This dataset consists of 10 seconds samples of 1886 songs obtained from the Garageband site. Beside the audio clips themselves, textual meta data is provided for the individual songs. The songs are classified into 9 genres. In addition to the genre information, our dataset also consists of 24 hierarchical c...
متن کاملA Benchmark Dataset to Study the Representation of Food Images
It is well-known that people love food. However, an insane diet can cause problems in the general health of the people. Since health is strictly linked to the diet, advanced computer vision tools to recognize food images (e.g. acquired with mobile/wearable cameras), as well as their properties (e.g., calories), can help the diet monitoring by providing useful information to the experts (e.g., n...
متن کاملClassification of Chest Radiology Images in Order to Identify Patients with COVID-19 Using Deep Learning Techniques
Background and Aim: Due to the important role of radiological images for identifying patients with COVID-19, creating a model based on deep learning methods was the main objective of this study. Materials and Methods: 15,153 available chest images of normal, COVID-19, and pneumonia individuals which were in the Kaggle data repository was used as dataset of this research. Data preprocessing inc...
متن کاملObject Classification in Images of Neoclassical Artifacts Using Deep Learning
The transformation of aesthetic styles has been at the heart of art history since its inception as a scholarly discipline in the late eighteenth century. Analyzing the single artifact and the carefully curated corpus have been the techniques for crafting hermeneutic understanding for such processes of change. Recently new instruments based on statistical techniques empower us for a fresh take o...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Computer Methods and Programs in Biomedicine
سال: 2021
ISSN: ['1872-7565', '0169-2607']
DOI: https://doi.org/10.1016/j.cmpb.2021.106318